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Abstract 

In this paper, we study the wireline two-unicast-Z communication network over directed 
acyclic graphs. The two-unicast-Z network is a two-unicast network where the destination 
intending to decode the second message has apriori side information of the first message. We 
make three contributions in this paper: 

1. We describe a new linear network coding algorithm for two-unicast-Z networks over di¬ 
rected acyclic graphs. Our approach includes the idea of interference alignment as one of 
its key ingredients. For graphs of a bounded degree, our algorithm has linear complexity 
in terms of the number of vertices, and polynomial complexity in terms of the number of 
edges. 

2. We prove that our algorithm achieves the rate-pair (1,1) whenever it is feasible in the 
network. Our proof serves as an alternative, albeit restricted to two-unicast-Z networks 
over directed acyclic graphs, to an earlier result of Wang et. al. which studied necessary 
and sufficient conditions for feasibility of the rate pair (1,1) in two-unicast networks. 

3. We provide a new proof of the classical max-flow min-cut theorem for directed acyclic 
graphs. 


‘This material is based upon work supported by the Air Force Office of Scientific Research (AFOSR) under award No. FA9550- 
13-1-0023. 

Uhis work was also supported by Viveck R. Cadambe's startup grant provided by the Department of Electrical Engineering at 
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1 Introduction 


Since the advent of network coding [I], characterizing the capacity region of networks of orthogonal 
noiseless capacitated links, often termed the network coding capacity, has been an active area of 
research. Inspired by the success of linear network coding for multicast networks [2,3], a significant 
body of work has been devoted to understanding the design and performance limits of linear 
network codes even for non-multicast communication scenarios. Previous approaches to linear 
network code design for non-multicast settings have at least one of two drawbacks: the network 
code design is restricted to a limited set of network topologies, or the approach has a prohibitive 
computational complexity. The goal of our paper is develop ideas and algorithms that fill this gap 
in literature. The main contribution of our paper is the development of a low-complexity linear 
network coding algorithm for two-unicast-Z networks - a network communication setting with two 
independent message sources and two corresponding destination nodes, where one destination has 
a priori knowledge of the undesired message (See Fig. [I]). We begin with a brief survey of previous, 
related literature. 


1.1 Related Work 


The simplest linear network code is in fact the technique of routing, which is used to show the 
max-flow min-cut theorem. In addition to the single-source single-destination setting of the max- 
flow min-cut theorem, routing has been shown to be optimal for several classes of networks with 
multiple source messages in 14 6 The technique of random linear network coding, which is 

is also shown to achieve capacity for certain non-multicast 

networks 


optimal for multicast networks 
1 ()[ 


The ideas of random linear network coding and routing have been combined 


to develop network coding algorithms for arbitrary networks in [11] . Nonetheless, it is well known 
that the techniques of random linear network coding and routing are, in general, sub-optimal linear 
network codes for networks with multiple sources. 

pessimistic results related to the performance 
Through deep connections between linear 


12-14 


To contrast the optimistic results of 
limits of linear network codes have been shown in 
network coding and matroidal theory, it has been shown that linear network codes are sub-optimal 
in general |12[[l5 |. In fact, most recently |14|, it has been shown that even the best linear code 
cannot, in general, achieve the capacity of the two-unicast network. 

Literature has also studied the computational limits of construction and performance evalua¬ 
tion of linear codes. It is shown in |16|| that characterizing the set of rates achievable by scalar 
linear code^jis NP-complete. Furthermore, for a given field size, the computational complexity of 
determining the best (scalar or vector) linear code is associated with challenging open problems 
related to polynomial solvability [18] and graph coloring 19 . Nonetheless, characterization of the 


computational complexity of determining the set of all rates achievable by linear network coding 
remains an open problem. One main challenge in resolving this question is to develop ideas and 
algorithms for constructing linear network codes for non-multicast settings. The study and devel- 


1 In fact, in [TJ, routing has been conjectured to achieve network capacity for multiple unicast networks over undirected directed 
acyclic graphs. 

2 Linear network coding techniques which do not necessarily involve choosing co-efficients randomly have also been studied for 
multicast and certain non-multicast settings in 00 

3 We use the term linear coding for network codes where all the codewords come from a vector space, and encoding functions 
are linear operators over this vector space. The term scalar linear network coding is used when the dimensionality of this vector 
space is equal to 1. It is known that vector linear network coding strictly outperforms scalar linear network coding in general G3- 
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opment of linear network codes for non-multicast settings is also important from an engineering 
standpoint, especially in current times, because its utility has been recently demonstrated in set¬ 
tings that arise from modeling distributed storage systems [201, wireless networks |2l|, and content 
disribution systems 1221. We review the principal approaches to developing linear network codes 
outside the realm of routing and random linear network coding. 

Network Codes over the binary field: Initial approaches to developing codes for non¬ 
multicast settings restricted their attention to codes over the binary field. Since the binary field 
provides a small set of choices in terms of the linear combinations that can be obtained by an 
encoding node, it is possible to search over the set of all coding solutions relatively efficiently. This 
idea was exploited to develop a linear programming based network coding algorithms in 123,24'. 
Reference 25 presents a noteworthy result that demonstrates the power of carefully designed 


network codes over the binary field. Through a careful understanding of the network communication 
graph, [25 characterized the feasibility of rate tuple (1,1) in two unicast networks. The result of 125 
can be interpreted as follows: the rate tuple (1,1) is achievable if and only if (i) the min-cut between 
each source and its respective destination is at least equal to 1, and (ii) the generalized network 
sharing bound - a network capacity outer bound formulated in 26] - is at least 2. Furthermore, 
the rate (1,1) is feasible if and only if it is feasible through linear network coding over the binary 
field. In general, however, coding over the binary field does not suffice even for the two unicast 
network |26|. 

Interference Alignment: Interference alignment, which is a technique discovered in the 
context of interference management for wireless networks, has recently emerged as a promising tool 
for linear network code design even for wireline networks. Specifically, an important goal in the 
design of coding co-efficients for non-multicast networks is to ensure that an unwanted message 
does not corrupt a desired message at a destination. The unwanted message can be interpreted as 
an interferer at the destination, and thus, tools from interference management in wireless networks 
can be inherited into the network coding setting. 

Interference alignment was first explicitly identified as a tool for multiple unicast network coding 



problem and the index coding problem. 

Index Coding Based Approaches: The equivalence of the general network coding problem 
and the index coding problem, which is established in [36,37], opens another door to the develop¬ 
ment of linear network codes for general networks. Specifically, for a given network coding setting, 
the approach of 37 can be used first to obtain an equivalent index coding setting; then the ap¬ 
proaches of [38-41 can be used to develop linear index codes. However, these index code design 


approaches have high computational complexity, since they require solving challenging graph color¬ 
ing related problems or linear programs whose number of constraints is exponential in terms of the 
number of users. Because of the nature of the mapping of 37], this means that these approaches 
require solving linear programs where the number of constraints is exponential number in terms 
of the number of edges of the network in consideration. Another common approach to obtaining 
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Figure 1: The Two-unicast-Z Network 


index coding solutions is given in 42 , which connects the rate achievable via linear index coding 


to a graph functional known as minrank. While in principle, the minrank characterizes the rate 
achievable by the best possible linear index code, the min-rank of a matrix over a given field size 
is difficult to evaluate, and NP-Hard in general 43 ; furthermore, there is no systematic approach 
to characterizing the field size. 

In summary, while recent ideas of interference alignment and connections to index coding 
broaden the scope of linear network coding, these approaches inherit the main drawbacks of linear 
network coding. Specifically, network coding approaches outside the realm of routing or random 
linear network coding are either carefully hand-crafted for a restricted set of network topologies, or 
their enormous computational complexity inhibits their utility. The motivation of our paper is to 
partially fill this gap in literature by devising algorithms for linear network coding. We review our 
contributions next. 


1.2 Contributions 

The goal of this paper is to devise systematic algorithms for linear network coding that incorpo¬ 
rate ideas from interference alignment. In this paper, we focus on two-unicast-Z networks over 
directed acyclic graphs. The two-unicast-Z network communication problem consists of two sources 
si,S 2 , two destinations t\,t 2 and two independent messages W\. W 2 . Message W % is generated by 
source Si and is intended to be decoded by destination tj. In the two-unicast-Z setting, destination 
t 2 has apriori side information of the message W\. Our nomenclature is inspired from the Z- 
interference channel |44| in wireless communications, where, like our network, only one destination 
faces interference]^] In fact, with linear coding, relation between the sources and destinations in the 
two-unicast-Z setting is the same as a Z-interference channel. We note that the two-unicast-Z net¬ 
work can be interpreted as a two unicast network with an infinitely capacitated link between source 
1 and destination 2. Therefore two-unicast-Z networks form a subclass of two-unicast networks. 

4 It is perhaps tempting to consider a network where si is not connected to and use the nomenclature of the two-unicast-Z 
for such a network. However, it is worth noting that routing is optimal for such a network and each message achieves a rate equal 
to the min-cut between the respective source and destination. Therefore, the study of such a network is not particularly interesting 
from a network capacity viewpoint. 
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Despite the simplicity of its formulation, little is known about the two-unicast-Z network. For 
instance, while the generalized network sharing (GNS) bound - a network capacity outerbound 
formulated in |26| - is loose in general for two-unicast networks, we are not aware of any two-unicast- 
Z network where the GNS bound is loose. Similarly, while linear network coding is insufficient for 
two-unicast, it is not known whether linear network coding suffices for two-unicast-Z networks. In 
this paper, we use two-unicast-Z networks over directed acyclic graphs as a framework to explore 
our ideas of linear network coding. We make three main contributions in this paper: 


( 1 ) 


(2) 


In Section [4j we describe an algorithm that obtains linear network codes for two-unicast-Z 
networks over directed acyclic graphs. Our approach is based on designing coding co-efficients 


to maximize the capacity of the implied end-to-end Z-interference channels in the network 45 


For graphs whose degree is bounded by some parameter, the complexity of our algorithm is 
linear in the number of vertices and polynomial in the number of edges. We provide a high 
level intuitive description of our algorithm in Section [2j 

In Section [5j we provide an alternate proof of the classical max-flow min-cut theorem for 
directed acyclic graphs. Our proof therefore adds to the previous literature that has uncovered 
several proofs of the theorem 1,46,47 . Our proof is a direct, linear coding based proof, and 
relies on tools from elementary linear algebra. This is in contrast to previous proofs which 
rely on graph theoretic results (Menger’s theorem) or linear programming. This proof, in fact, 
inspires our algorithm for two-unicast-Z networks. We provide an intuitive description of the 
proof in Section [2] 


(3) In Section[6j we prove that our algorithm achieves a rate of (1,1) whenever it is feasible. Note 
that the necessary and sufficient conditions for the feasibility of the rate pair (1,1) has been 
characterized in 1251. Our proof in Section [6] provides an alternate proof of the feasibility of 
the rate pair (1,1), when restricted to two-unicast-Z networks over directed acyclic graphs. 
Like our alternate proof to the max-flow min-cut theorem, our proof of Section [6] also relies 
significantly on elementary linear algebra. 


2 Intuition Behind the algorithm 

Consider the two-unicast-Z network described in Fig. [lj In this network, with linear coding at all 
the nodes in the network, the input output relationships can be represented as 


Yi = X 1 H 1 + X 2 H 2 , Y 2 = X 2 G 2 , 


where for i £ {1, 2}, X.; is a row vector representing the input symbols on the outgoing edges of the 
rth source, Y.; is row vector representing the symbols on the incoming edges of the zth destination 
node followed by interference cancellation with the side information if i = 2. Note that if Hi, H 2 
are fixed and known, the input-output relations are essentially akin to the Z-interference channel. 
Using the ideas of El Gamal and Costa 48] for the interference channel, the set of achievable rate 
pairs (R .\, /?, 2 ) can be described (see 145j 49,501) as the rate tuples (i?i,i? 2 ) satisfying 


Ri < rank (Hi), 


i?i + i? 2 < rank 


Hi 

H 2 


i? 2 < rank (Go) 

+ rank([H 2 G 2 ]) 


rank(H 2 ) 


( 1 ) 

( 2 ) 
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The goal of our algorithm is to specify the linear coding co-efficients at all the nodes in the 
network, which in turn specifies the matrices Hi,H 2 ,G 2 - Once these matrices are specified, the 
rate region achieved in the network is specified by <§• Here, we describe our approach to 
designing the linear coding co-efficients in the two-unicast-Z network. 

To describe our intuition, we begin with the familiar single source setting, and describe the ideas 
of our algorithm restricted to this setting. Note that with a single source and single destination, 
with linear coding in the network, the end-to-end relationship can be represented as Y = XH, 
where X and Y respectively represent the symbols carried by the source and destination edges, 
and H represents the transfer matrix from the source to the destination edges. We know from 
classical results that the linear coding co-efficients can be chosen such that the rank of H is equal 
to the min-cut of the network. Here, we provide an alternate perspective of this classical result. 
Our examination of the single-source setting provides a template for our algorithm for the two- 
unicast-Z network which is formally described in Section |4j Our approach also yields an alternate 
proof for the max-flow min-cut theorem which is provided in Section [5] 


2.1 Algorithm for Single-Unicast Network 

We focus on a scenario shown in Fig. [2j Denote the network communication graph Q = (V,£), 
where V denotes the set of vertices and £ denotes the set of edges. Now, suppose that, as shown 
in Figure [ 2 J a linear coding solution has been formulated for Q = (V,£ — {e}), where e denotes an 
edge coming into the destination node. The question of interest here is the following: How do we 
encode the edge e so that the end-to-end rate is maximized? We assume that our coding strategy 
is restricted to linear schemes. 

Let X be a 1 x S vector denoting the source symbols input on the S edges emanating from the 
source node. Let H denote a S x (D — 1) linear transform between the input and D — 1 destination 
edges - all the destination edges excluding edge e. Let pi, P 2 , • • •, Pk be 1 x S vectors respectively 
denoting the linear transform between the source and the k edges coming into edge e, that is, the 
zth incoming edge. Now, our goal is to design the coding strategy for edge e, that is, to choose 
scalars aq, 0:2 ,&k such that the rate of the system 


Y = X 


H 


k 

i=l 


is maximized, given matrix H and vectors pi, P 2 , ..., 
ai, « 2 , ■ • •, cy-k such that the rank of H Yli=i a jPj 
is quite straightforward - one can notice that if 


Pa,. Equivalently, the goal is to choose scalars 
is maximized. The solution to this problem 


rank ([H pi p 2 


Pfc]) > rank (H) 


H Hi=i^jPj 


(3) 

is equal to the 


then the scalars a\,... ,ak can be chosen such that the rank of 
rank (H) + 1. Since rank(H) is the rate obtained by the destination if edge e is ignored, the 
implication is that if 0 is satisfied, then, we can design a linear coding strategy such that edge 
e provides one additional dimension to the destination. In fact, if 0 is satisfied and the field 


of operation is sufficiently large, then choosing the scalars randomly, uniform 


the field of operation and independent of each other increases the rank of 
implying the existence a linear coding solution. 




y over 
by 1, 
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Figure 2: A single-unicast scenario depicted pictorially. The goal is to find scalars a .%,..., 07 ,.. 


A solution to the scenario of Fig. [2] naturally suggests a linear coding algorithm for the single 
unicast problem. Suppose we are given a directed acyclic graph Q = (V,£), a set of source edges 
S C £, a set of destination edges D C £. Our strategy removes the last topologically ordered 
edge e 6 D and finds a linear coding solution for the remaining graph. That is, specifically, we 
develop a linear coding solution for Q = (V,£ — {e}) , with source edges S and destination edges 
D — {e} U In(u), where v represents the tail node of edge e, and In(u) represents the set of edges 
incoming on to edge v. Therefore, we have reduced our original problem, which intended to design 
coding co-efficients for \£\ edges, to one which needs to design coding co-efficients for \£\ — 1 edges, 
albeit with a slightly different set of destination edges in mind. We can now recursively iterate the 
same procedure to this smaller problem, removing the last edge as per topological ordering at each 
iteration and modifying the destination edge set accordingly until all the edges are removed except 
the source edges. A trivial coding solution applies to this graph, which forms a starting point for 
the recursive algorithm we have described. 

While our insight might appear superfluous in the context of the single unicast setting, it 
does lead to an alternate proof for the max-flow min-cut theorem for directed acyclic graphs. To 
conclude our discussion, we provide an intuitive description of the proof; the proof is formally 
provided in Section [5} In our proof, we make the inductive assumption that the max-flow min-cut 
theorem is valid for the source S and for any destination set which is a subset of £ — {e}. Under 
this assumption, we show using ideas from classical multicast network coding literature that the 
optimal linear coding solutions for the two possible destination sets D — {e} and D — (e}Uln(u) can 
be combined into a single linear coding solution that simultaneously obtains the min-cut for both 
destination sets. Then, we use this combined solution along with the solution to Fig. [2] and show 
that this linear coding solution achieves a rate equal to the min-cut for destination set D. More 
specifically, we show that if the edge e belongs to a min-cut for destination D , then the inductive 
assumption implies that, for this combined solution, ([ 3 ]) holds; our strategy of choosing coding 
co-efficients randomly over the field ensures that a rank that is equal to the min-cut is achieved for 
graph Q with destination D as well. 

2.2 Algorithm for Two-Unicast-Z Networks 

Consider a two-unicast-Z network of the form shown in Fig. [3j The graph Q = (V,£) consists of 
two sets of source edges Si, 52, two sets of destination edges Ti,T 2 , with the destination 2 being 
aware of the message of the first source apriori. Now, consider a situation where a coding solution 
has been formulated for all the edges of the graph, with the exception of edge e E D\. We are 
interested in understanding how to encode the edge e so that the end-to-end rate is maximized. 
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Figure 3: A two-unicast-Z scenario depicted pictorially. The goal is to find scalars aa,..., a*, to maximize Q. 

Our heuristic is based on maximizing the sum-rate that is, the right hand side of equation (Hi- 
Based on Fig. [3j our goal is to find oq, « 2 , • • •, ctfc such that 


rank 


H 1 J2i =1 a iPi,l 

H 2 Yli= 1 a * P *,2 


+ rank 




CtiPi,2 



rank 


H 2 J2i =1 a iPi,2 


(4) 


is maximized. To do so, we examine two cases: 


Case 1 If pi 2, ..., p/c,2 He in the span of Hq, then, clearly, choosing oqs randomly and uniformly 
over the field is the best strategy with a probability that tends to 1 as the field size increases. 
This is because in the third and negative term in Q, the column corresponding to the random 
linear combination of pj^’s do not contribute to the rank of the matrix, while random linear 
coding maximizes the first two terms with high probability. 

Case 2 If pi 2 ,..., Pfc ,2 does not lie in the span of Hq, then the solution is a bit more involved. 
We divide this case into two sub-cases 


Case 2a Suppose that pi, 2 ,..., Pk .2 do not lie in the span of [H 2 g 2 ], then chosing oqs ran¬ 
domly maximizes the expression of ©■ In particular, we note that choosing aqs randomly 
increases the two positive terms and the negative term of (4]) by 1, effectively increasing 


the the expression of (4) by 1, as compared with rank 


Hi 

H 2 


+ rank ([H 2 G 2 ]) — 

rank (Ho). We later show in Lemma [T] that the expression of Q cannot be increased by 
more than 1; this implies the optimality of the random coding approach for the case in 
consideration here. 


Case 2b Suppose that pi, 2 , ..., Pk ,2 lies in the span of [H 2 G 2 ] , but does not lie in the 

span of H 2 . In this case, the optimal strategy is to choose co-efficients aq,o: 2 , ■ ■ ■ ,otk 
so that Yli=i criPi ,2 is a random vector in the intersection of the column spaces of H 2 
and [pi ,2 P 2.2 • • • Pfc,2] • ! n other words, we intend to align the local coding vector 

on edge e in the space 0 /H 2 . 


From the above discussion, it is interesting to note that we naturally uncover interference alignment 
in Case 2b as a technique that maximizes the expression of ([4]). In fact, we show later in Lemma 












[2] that, if 


rank 


( 


Hi Pi,! p 2 .i • • ■ Pfc,l 
V LH 2 Pi,2 P2,2 ■ • • Pfc,2 


+ rank ([H 2 Pi ,2 P 2,2 • • • Pfc ,2 G 2 ]) 

—rank ([H 2 pi, 2 Pi ,2 ••• Pfc, 2 ]) 


> rank 


Hi 

H 2 


+ rank ([H 2 G 2 ]) - rank (H 2 ) 


then, our choice of ai, a 2 , ■ ■ ■ ,&k increases (4) by 1 as compared with rank ^ ^ +rank ([H 2 


rank (H 2 


(5) 

g 2 ])- 



Figure 4: A two-unicast-Z scenario depicted pictorially. The goal is to find scalars o- |,..., o/, : to maximize (Jdji. 


Now consider the scenario of Fig. [4j In this scenario, the goal of maximizing the right hand side 


of (|2]) is tantamount to choosing scalars cti, « 2 , •••, ctfc to maximize rank ^ H 2 G 2 Si=i a *P*,2 
The scenario is similar to the single-unicast problem discussed earlier, and choosing the scalars a . 
randomly, uniformly over the field of operation and independent of each other maximizes the sum- 
rate. In fact, it is easy to see that this strategy improves the sum-rate by 1, if 


rank 
> rank 


^ ^ ^ + rank ([H 2 G 2 pi, 2 P 2,2 • • • Pfc, 2 ]) - rank (H 2 

+ rank ([H 2 G 2 ]) - rank (H 2 ). 


Hi 

H 2 


( 6 ) 

(V) 


Remark 1. Strictly speaking, the sum-rate of the two-unicast-Z network is equal to the minimum of 

k 


rank 


«. £ 


oie pe,i 


1 =1 


+ rank( G 2 ) 


and the expression o/(|4]). In this paper, for the sake of simplicity, we restrict our attention to maximizing the 
expression of ©; our ideas and algorithms can be easily modified to maximize the smaller of 

k 


rank 


Hi ^2 °TPl 


1=1 


+ rank( Gf) 


and Q. 
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To summarize, we observe from © and 0 that an edge e can increase the right hand side of 
0 by 1 if, the parent edges of the edge e in combination to the other already existing edges in 
the destination can increase the right hand side of 0 by at least 1. Furthermore, our strategy 
to maximize the right hand side of Q automatically uncovers the idea of alignment. Before 
proceeding, we briefly explain how the problems of Fig. [3] and [4] can be composed naturally into a 
recursive algorithm to design linear coding co-efficients for the entire network. 

We represent the network as a directed acyclic graph Q = (V,£), where V denotes the set of 
vertices, and £ denotes the set of edges. We begin with the last edge of the graph, that is the 
edge e with the highest topological order in the graph. This edge e is incident on either the first 
destination or the second destination. Given a coding solution for the graph Gi = (V, £ — {e}), 
we can design a linear coding solution based on the above approach. Therefore, we aim to design 
a coding solution for the smaller graph Gi- To do this, we add all the parent edges of e to the 
corresponding destination. That is, if edge e is in destination 1, modify destination 1 in Gi to 
include all the edges incoming on to the vertex v, where v is the vertex from which edge e emanates 
(See Figs. 0. Similarly, if edge e is incident onto destination 2, we remove edge e to reduce the 
problem to a smaller graph Gi and all the parent edges of e to destination 2 (See Fig. 0. Now, 
our goal is to find a linear coding solution to the smaller problem Q\. We proceed similarly by 
identifying the last topologically ordered edge in Gi and removing it to obtain G 2 , and further 
modifying the destinations. If we proceed similarly, removing one edge at a time from graph G, we 
obtain a sequence of graphs Gi, G 2 , ■ ■ ■, to eventually obtain a graph Gn where the destination edges 
coincide with the source edges. Starting with a trivial coding solution for Gn, we build a coding 
solution for the sequence of graphs Gn,Qn- i, ■ • • ,Gi and eventually obtain a coding solution for 
graph Q. Our approach is formally outlined in Section 0 Before we proceed, we note that in our 
sequence of graphs obtained above, it can transpire that the last topologically ordered edge in one 
of the graphs belongs to both destinations. We omit an explanation of this scenario here, since our 
approach in handling this scenario is similar to the one depicted in Fig. 0 

3 System Model 

Consider a directed acyclic graph (DAG) Q = (V,£), where V denotes the set of vertices and £ 
denotes the set of edges. We allow multiple edges between vertices, hence, £ C V x V x Z + , where 
Z + denotes the set of positive integers. For an edge e = (u, v, i ) £ £, we denote Head(e) = v and 
Tail(e) = u ; in other words, when the direction of the edge is denoted by an arrow, the vertex at the 
arrow head is the head vertex, and the vertex at the tail of the arrow is the tail vertex of the edge. 
When there is only one edge between node u and node v, we simply denote the edge as (u,v). For 
a given vertex v £ V, we denote In(v) = {e £ £ : Head(e) = v} and Out('u) = {e £ £ : Tail(e) = u}. 

In this paper, we focus on the networks with one or more unicast sessions. We define each source 
as a node in V, while each destination as a subset of edges in £. Subsequently, a single unicast 
network problem D can be specified by a 3-tuple (G, s, T ), where G = (V, £) is the underlying graph, 
s £ V is the source node and T C £ is set of destination edges. Every node in the graph represents 
an encoding node, and every edge in the graph represents an orthogonal, delay-free, link of unit 
capacity. 

In two-unicast-Z networks, we use the set S = {si,S 2 } ; where si,S 2 £ V, to denote set of two 
sources. We use T = to denote the set of two destinations, where T\ and X 2 each is a 

set of edges, i.e., T) C £,i = 1,2. To keep the scenario general, we allow an edge to belong both 
destinations, i.e., Ti n T 2 need not be the null set. Furthermore, the head vertices for edges in 
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the same destination may not have to be the same, i.e., for ei,e 2 £ Tj,i = 1,2 it is possible that 
Head(ei) 7 ^ Head(e 2 ). Without loss of generality, we consider graphs where the edges with the 
highest topological order belongs to T\ U T 2 . 

In the two-unicast-Z network, the sources si and s 2 generate independent messages W\ and W 2 
respectively. The message W\ is available a priori to destination T- 2 - The goal of the two-unicast-Z 
network is to design encoding functions at every node in the network and decoding functions such 
that W\ is recoverable from the symbols carried by the edges in T\ , and W 2 is recoverable from the 
symbols carried by the edges in T 2 and the side information W\. Without loss of generality, assume 
that Si communicates with at least one edge in 2} for i £ {1,2}. Similar to the case of single unicast, 
we can denote a two-unicast-Z network coding problem Q using a 3-tuple, i.e. 12 = (G, S, T), where 
S = {si, S 2 } , T = {Ti, T 2 }. 

A rate pair (, -R 2 ) is achievable if for every e > 0 ,5 > 0, there exists a coding scheme which 
encodes message W % at a rate Ri — 6i, for some 0 < 5i < 5, such that the average decoding error 
probability is smaller than e. The capacity region is the closure of the set of all achievable rate 
pairs. 

Topological Order 

Since the graphs considered in this paper are directed acyclic graphs, there exists a standard 
topological order Ordy on the set of vertices V of the graph. The order Ordy satisfies the following 
property: if there is an edge (u, v, i ) £ £, then, Ordy (u) < Ordy(u). We define a partial order Ord^ 
on the set of edge £ such that the order of an edge is equal to the order of the tail node of the 
edge, i.e., Ord,f (e) = Ordf (Tail(e)), e £ £. Note that all edges sharing the same tail node have the 
same order. When there is no ambiguity, we omit the subscript in the ordering and simply denote 
the ordering on the edges (or vertices) as Ord. 

Linear Network Coding 

In this paper, we consider scalar linear coding, where the encoded symbol along each edge is an 
element of a finite held F. We use the algebraic framework of linear network coding of [2| to relate 
the linear coding co-efficients at the vertices of the graph to the encoded symbols carried by the 
edges. Specifically, we describe a linear coding solution for a network using a local coding matrix 
F, whose i-th column corresponds to the edge with topological order i and stores the local coding 
vector on the edge from the symbols carried by its parent edges. The linear transfer matrix of the 
entire network therefore given by M = (I — F ) _1 and the transfer matrices between sources and 
destinations can be obtained as submatrices of the network transfer matrix M. 

For i £ {1,2}, we denote the symbols carried by the edges emanating from source i by the 
1 x |Out(sj)| vector X,;. Similarly, we denote the symbols carried by the edges in 7} to be the 
lx |Tj| vector Y,;. For a linear coding scheme, we write 

Yi = X 1 Hi+X 2 H 2 (8) 

Y 2 = XiG 1 +X 2 G 2 , (9) 

where H, is the |Out(sj)| x |Ti| transfer matrix from Xj to Yi and Gj is the |Out(sj)| x |T 2 | transfer 
matrix from Xj to Y 2 . Note that the matrices H, and Gj are sub-matrices of the network transfer 
matrix M by selecting the rows and columns corresponding to the specified source and destination 
edges respectively. 
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For the two-unicast-Z network, the rate pair (iii,.R 2 ) achieved by a linear coding scheme is 
characterized by 


R\ < rank (Hi), 


Ri + i ?2 < rank 


Hi 

H 2 


i ?2 < rank (G 2 ) , 

+ rank ([H 2 G 2 ]) - rank (H 2 ) 


( 10 ) 

( 11 ) 


Note that the rate region does not depend on the matrix Gi since destination 2 cancels the effect 
of XiGi using its side information. The rate region can be derived as a simple corollary of the 
result of 48 , which obtains the capacity of a class of deterministic 2-user interference channels. 
We refer the reader to |45 for a proof. 


Notations 

The cardinality of a set E is denoted by \E\. For sets A and B , A\B denotes the set of elements 
in A but not in B. For a matrix A, colspan(A) denotes its column span and Ker(A) denotes the 
nullspace of colspan(A). In a graph Q = (V, £), For u, v E V, u v indicates that u communicates 
with v, i.e. there is a path from u to v on Q. u -/+ v means that u does not communicates with 
v on Q. For i,j E £, i j is equivalent to Out(i) In(j). We denote sub-matrices of the local 
and global coding matrices by super-scripts. Specifically, for two edge sets £\. £2 C £. the matrices 
F^ 1 ’^ 2 and M^ 1 ^ 2 respectively represent \£\\ x |T 2 | dimensional sub-matrices of F and M derived 
from the rows corresponding to £\ and columns corresponding to £ 2 . For example, we can write 
Hi = M 0ut ( Sl )’ Tl . 

In the context of the two-unicast-Z networks, a GNS-cut set as defined in |26| is a set Q E £, 
such that Q is 

1 . a si — Ti cut-set, and, 

2. a s 2 — T 2 cut-set, and, 

3. a s 2 — Ti cut-set. 

The GNS-cut set bound of a two-unicast-Z network is defined to be the cardinality of the smallest 
GNS set in the network. It is shown in 126) that the GNS-cut set bound is an information theoretic 
upper bound on the sum-rate achievable in the network. 


4 Recursive, alignment-based, linear network coding algorithm 


In this section, we present a scalar linear network code construction for the two-unicast-Z network 
problem. The algorithm consists of two sub-routines, the destination reduction algorithm which 


is described in Section 4.2, and the recursive code construction which is described in Section 4.3 


When both of them are run, the recursive coding routine returns the coding matrix F. The rate 
achieved can be obtained via ( fl0| ), The sub-routines are pictorially depicted in Fig. [6] for 
the network in Fig. [5j Before we describe these sub-routines, we begin with some preliminary 
definitions and lemmas that will be useful in the algorithm description. 
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4.1 Preliminaries 

Definition 1 (Grank). Given matrices Hi, Ho, G 2 of dimensions P\ x Q \, P 2 x Q 1 and If x respectively, 
where P\, If ,Qi, Q'l are positive integers, the Grank is defined as 


Grank( Hi, H 2 , G 2 ) = rank 



+ rank ([H 2 


g 2 ]) 


rank (H 2 ) • 


Note that the Grank is related to the sum-rate of the two-unicast-Z network where Hi, H 2 and 
G 2 respectively represent the transfer matrices between source 1 and destination 1, source 2 and 
destination 1, and source 2 and destination 2. 


Remark 2. We can show that 


Grank(Hi, H 2 , G 2 ) = min rank ^ . 

GiGF p i x< 52 I /* 2 

Our use of the term “Grank” is inspired by the above observation which indicates the quantity of interest 
is closely related to the rank of an appropriate matrix. 

Remark 3. We have shown in [ \45 1 /, that, if H 1 . H 2 and G 2 respectively represent the transfer matri¬ 
ces between source 1 and destination 1, source 2 and destination 1, and source 2 and destination 2, then 
Grank( Hi, H 2 , G 2 ) is upper bounded by minimum generalized network sharing cut value of the network. 

We state some useful properties of the Grank next. 

Lemma 1. Let Hi, H 2 , G 2 , A, B and C be matrices with entries from a finite field F, respectively having di¬ 
mensions P±xQi, P 2 xQ\. P 2 X Q 2 ■ P\ x M. P '2 x M and P 2 X N, for positive integers Pi • P 2 ,Qi, Q‘i ■ M, N. 
Then the following properties hold. 


(i) Concatenation of columns to matrices does not reduce Grank. 


Grank ([ Hi A],[H 2 B],[G 2 C ]) > Gnmk(H 1: H 2 , G 2 ). 

(ii) Concatenating M column increases the Grank by at most M. 

Grank ([Hi A], [H 2 B], G 2 ) < GranJfe(Hi, H 2 , G 2 ) + M 
Grank( Hi, H 2 , [G 2 C]) < Grank( Hi, H 2 , G 2 ) + N. 

(in) Concatenation of linearly dependen t columns does not change the Grank. Suppose that 


colspan 


Hi 

H 2 


then 


C colspan 
colspan [C] C colspan [G] , 

Grank ([Hi A],[H 2 B],[G 2 C}) = Grank(H 1 ,H 2 ,G 2 ). 
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Statement (i) follows from submodularity of the rank function. Statements (ii) and (in) follow 
from elementary properties of the rank of a matrix and the definition of the Grank. We omit a 
proof of the lemma here. 

Next, we state a lemma that will be useful later on in generating our linear coding solutions. 

Lemma 2 . Let Hi,H2,G2, A,B be matrices with entries from a finite field IF, respectively having of 
dimensions P\ X Q \, P2 x Q] ■ If X Q'l- P\ X M and X M. Suppose that 

Grank ([Hi A],[H 2 B], G 2 ) > Grank(Hi, H 2 , G 2 ) 

Then, there exists a M X 1 column vector f such that 

Grank( [Hi Af], [H 2 Bf], G 2 ) = Grank( Hi, H 2) G 2 ) + 1 (12) 


Furthermore, if 


(i) colspan( B) C colspan([ H 2 G 2 ]) 

(ii) colspan( B) <f_ colspan( H 2 ), 


then, choosing a vector v randomly from the null space of the column space of [LL B] and setting f to be 
the last M entries ofv satisfies ( 12 1 with a probability that approaches 1 as the field size F increases. In 
this case, the vector f satisfies the following property: Bf £ col span (Ho)- If (i) ° r (**) are not satisfied, 
then picking the entries of f randomly and uniformly over the field satisfies dZ3> with a probability that 
approaches 1 as the field size increases. 


Since adding a single column can increase the Grank by at most 1, the vector f that satisfies 
( fT2| ) maximizes the Grank([Hi AfJJtL Bf],G 2 ). The vector f will be useful in obtaining the 
code construction in the recursive coding routine. The approach of choosing f randomly to satisfy 
( |12| ) follows the spirit of random linear network coding [8], 


4.2 Destination reduction 

The destination reduction algorithm takes the original problem f l = ( Q,S,T) and generates a 
sequence of N + 1 ordered two-unicast-Z network problems, for some N £ Z+, starting with the 
original problem itself. We denote the sequence of problems as P = ..., , 

where = Ll and = (Q,S,T^) with being the destination sets for the 

problem number i. In particular, all the problems have the same underlying graph Q and source 
set S, but different destination sets, i.e., f T^\i j. The algorithm is formally described 
in Algorithm [lj in which the key procedure is to sequentially generate from the previous 

problem f . We describe the process informally here. 

Recall that in a directed acyclic graph, there is a total ordering Ord on the vertices of the 
graph. Also recall that Ord induces a partial ordering on the edges, where the set of edges of the 
same topological order share a common tail node. In brief, the destination reduction algorithm 
obtains problem from fR*) as follows. We find all the highest topologically ordered edges in 

the union of the two destination sets. In each destination set, if it contains any of these edges, we 
replace them with their immediate parent edges. Specifically, given fiW = (G,S,T^), let E 
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Algorithm 1 Destination reduction algorithm 
1: procedure REDUCTION^ 0 )) 

2: P «— () 

3: add fi(°) to P 

4: i «— 0 

5: S <— {e : Tail(e) G 5} 

6: while Tp* U T 2 (i) % S do 

7: E <- |e : argmax (<) (i) Ord(e)| 

Ej <- ECiT^J = 1,2 
n «— Tail (A) 
for j -t- 1,2 do 

if Ej 7^ 0 then 

rf +1) •<— uin(u) 

else 

T"i(*+1) ,_ Ti(*) 

j ^~ j 

end if 

end for 

>y“(i+l) ^^>(®+B^ 

18: D(* +1 ) <- (a,5,T (i+1 )) 

19: add 0(* +1 ) into P 

20: % i — i 1 

21 : end while 

22: return P 

23: end procedure 
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10 

11 

12 

13 

14 

15 

16 
17 


(i) (i) 

denote the set of edges in 7} U T 2 ' with the highest topological order. In other words, all edges 
(i) (i) 

in E C Tl U T 2 ; have the same topological order, and a strictly higher topological order with 
respect to every edge in U T^XE. For j G {1,2}, let Ej = n E. For each destination 
j G {1,2}, if T- does not contain any highest topological ordered edge, i.e., if Ej = 0 , then the 
destination set remains unchanged in i.e., T-* +1 ^ = Otherwise, all edges in Ej are 


removed in T^ l> and replaced by In(u) to produce the new destination set 1 * in T^ +1 \ that is, 


0+1) 5 , 


Tf +1) = (T^\Ej) Uln(u). 


-.(Ox 


j 


Before proceeding to describing our coding scheme, we list some useful and instructive properties 
of the destination reduction algorithm; these properties can be easily checked for the example in 
Fig. [6} 


i(0\ tB*! 1 ) 


has a common tail node v, which also forms the com- 


Property (i) The set of edges Tj ; \Tj 

mon head node of all the edges in Furthermore, there are only two 

possibilities: is empty or = In(u). 


Property (ii) An edge in the graph which communicates to at least one edge in Ti appears in 


(0 


15 









Figure 5: An example network used to demonstrate our algorithm operation 
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Figure 6: The destination and recursive coding algorithms shown for the network of Fig. [5] The scalars 
P 2 ,q 2 ,P 3 ,P 4 , ( H:Po are chosen randomly and independently of each other. Note that the recursive coding 
algorithm operating at Stage 0 performs alignment step at vertex v&. 


for some value i G {0,1,2,..., N}. An edge which communicates to at least one edge 
(i) 

in T 2 appears in T 2 ' for some value of i. Consider an edge e which communicates to 
at least one edge in T\ and at least one edge in T 2 . Let k\ denote the largest number 
such that the edge e belongs to . Let k 2 denote the largest number such that 
the edge e belongs to T^ 2 \ Then k\ = k 2 . 

Property (iii) Consider a two-unicast-Z problem 0. where every edge in the graph is connected to 
at least one of the destinations, that is, there is a path from every edge to at least 
one edge in T\ UT 2 . Then, the set of all edges have a lower topological order with 
respect to U \ U in Q is equal to 

(J T[ k) UT 2 (fe) . 

Property (iv) In the the destination edges are collocated with the source edges. That is 

t|' V ) UT 2 (JV) C Si U S 2 . Furthermore, if every edge emanating from the source nodes 
communicates with at least one of the destination nodes, then T^UT^ ^ = SiUS 2 . 
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Property (v) For all i, 0 < i < N and j = 1,2, the destination set Tf forms a cut set between 
both sources si,S 2 an d the destination set T) for the original problem G. 

Properties (i) and (iv) can be verified by examining the algorithm. We prove Properties (ii), (iii) 
and (v) in Appendix |A} It is instructive to note that Properties (ii) and (iii) imply that the collection 
of sets 

| (V® U Tf\ \ (rf +1) U T 2 (i+1) ) : i = 0,1, 2 ,..., JV J 

forms a partition of the set of edges of the graph. Furthermore, this partition is the same as the 
partition implied by the topological ordering on the edges, i.e., all edges of the same topological 
order belong to one unique member of this partition. As we observe next in our description of the 
recursive coding algorithm, the recursion at depth j designs the local coding co-efficients for the 
edges in the jth member of this partition. 

4.3 Recursive coding construction 

We describe the recursive coding algorithm formally in Algorithm [2] We present an informal 
description here. Without loss of generality, we only consider networks where every source edge 
communicates with at least one edge in T± U T 2 . The first step of the recursive coding construction 
begins with a trivial coding scheme for In particular, note that Property (iv) states that 

T, (A ° U T.'f' 1 is equal to Si U S 2 . We set the local coding vector for an edge in f N> U T.f' 1 to 
be the vector with co-efficient 1 corresponding to the edge and 0 elsewhere. We assume that the 
local coding vectors for all the edges outside of t| A * U to be indeterminate at this point; the 
local coding co-efficients for these edges will be determined using the recursive coding algorithm. 
Each step of the recursion is referred to as a stage. The recursive algorithm has N stages, where at 
stage i, the algorithm generates the code for G^ using the coding scheme for the previous stage for 
. In particular, the recursive coding algorithm accomplishes the following: Given a linear 
coding scheme for the (i + l)th stage, that is, for G(* +1 ', the algorithm at the ith stage constructs 
a linear coding scheme for G^. Starting with the trivial coding scheme for G^', our algorithm 
recursively constructs coding schemes for the problems f..., G^ 1 ), which eventually 
leads to a coding scheme for the original problem G = G^ 0 ). Next we focus on the coding algorithm 
at stage i assuming a linear coding scheme for G(* +1 ) is given in the previous stage. 

A solution to the problem G^ +1 ^ will describe local coding co-efficients for all the edges in 
Ui+i<fc<Ar U lf +1) , and leave the co-efficients for the remaining edges to be indeterminate. 

Given a linear coding solution for the problem the coding solution for the problem Gw 

inherits the linear coding co-efficients from the solution to G(* +1 ) for all edges in Uj+kakat r[ i+l) u 
T^‘+ 1 \ complete the description for a solution to G®, the algorithm specifies the local coding 
co-efficients for edges in (rf Upj \ U . Because of Properties (ii) and (iii) of 

the destination reduction algorithm, we note that specifying local coding co-efficients for edge in 
(T-P U T,f ^ \ U T 2 ' +1 ^ suffices to specify the global coding-coefficients for these edges as 

well, since all the edges of which have a lower topological order with respect to this set have been 
assigned coding co-efficients in the solution to G^ +1 \ 

We use the following notation in our description. For j = 1,2, let 

uf = Tf n 7p +1) , if = Tj i+1 \uf, of = Tf\uf. (13) 
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Recall from Property (i) that all the edges in Tf\Tf~ 1 ^ have a common tail node v, which 
is also the head of all the edges in The set if is therefore contained in the set of 

incoming edges on to this node v. The set O )' is contained in the outgoing edges from v. Based 


on Property (i), we observe that if if ^ <f>, if / <j>, then If = 1f = In(u). The set Uf is the set 
of unchanged destination edges between T-* +1 ^ and Tf. We divide of into two disjoint subsets, 


r(®) 


r(*) _ rW 


r(®) : c 


i(i) 


i 2 and Bf , such that Of = Af U B 2 1 , where 


i(0 


1 (0 


,(0 


A® = of n of 


B f = of\o\ 


W\ 


(14) 


Note that Property (ii) implies that Bf contains edges that communicate with at least one edge 
in T 2 but not T\. Similarly, Af contains edges that communicate with at least one edge T\ and at 
least one edge in T 2 . It is useful to note that Af f (f> =>■ if f (f>, I 2 f (j) 


and of form a partition of the set [ff U T 2 

co-efficients for Bf and of in the recursive coding algorithm. 

Henceforth we will use the following notation. For any set of edges P C Tf, i G {1,2,}, 
the transfer matrix between source i and P is denoted as H ? p . For any set of edges Q C T 2 \i G 
{1,2}, the transfer matrix between source i and Q is denoted as G ( f Furthermore, for the sake 
of consistency, we will assume that the columns of Hf, G^ are ordered based on the topological 


"»(*) I I rp{i) 


if = if. Since B [ f 
\ (T|' ?+1 ^ U T 2 +1 f , we specify local linear coding 


orderings of the edges in P and Q. Note that with our notation, H 2 


rf nrf _ 


W 


= G 2 1 


(0 

12 . With 


this notation, the local coding vectors at node v generate H^ 1 

r(i) 


r(0 a ( a ) d(*) 

from H' 1 , and Gf and G J 2 


from Gj 2 , for j = 1,2. Therefore, the goal of the recursive coding algorithm is to design local 

coding matrices F p i 1 P 2 J a t vertex v, with dimensions |In(u )| x |0^|, and |In(u)| x \Bf \ 

respectively. The global coding co-efficients will be determined as, for j G {1,2}, 


£)W 

H? 1 = 


(0 

H? F 


TO’) r (i) ^(i) A (®) TO/ r i L) a 

-*1 ' , 0 ^ J (~*^2 _ (~^*2 fp -^2 


r(0 


,g7 2 =g;. 2 ; Gf 2 * 1 =gJ 2 f 7 2 v ’ s 2 


r(*) 


r (0 e >(0 


(15) 


^ a (i) m # r-(i) 

where note that, if Af is non-empty, then F J 2 > A 2 i§ a sub-matrix of FT ,u i . This is because 


i(») 


12 (= of and, if A 2 ’ is non-empty, If = I 2 


i(») 


r(®) _ r(*) 


Informal Description of the Recursive Coding Algorithm: A formal description is de¬ 
scribed by Algorithm [2j Here, we present an informal description of the recursive coding algorithm 
for the ith stage, assuming that stage z + 1 is complete. As previously stated, our goal is to generate 

r(z) r(i) (i) t(1) r~) (z) 

F J i ’ u i , F j 2 >^2 in (15). We do this in two phases, in the first phase, we find F J a >^2 and in the 


j(i) /^(z) ^ # ^ # ^ 

second phase, we determine F 1 . Our strategy is based on the idea of designing the coding 

co-efficients so that the Grank is maximized at each stage. 


4.3.1 Phase 1 

? (0 _ 


(*) 


If Bf = 0, proceed to the next phase. Otherwise, select each entry of the matrix F 1 ^ ^ 
uniformly at random from the underlying finite field F. We note that, over a sufficiently large field, 
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Algorithm 2 Recursive Coding Algorithm 

1: procedure RECURSIVE C0DING(P, F) > F represents the field over which the coding is performed 
Denote P by ^ (i+1) ,..., fiW). 

Denote fiW = (Q, S, , T 2 ^}) and use the notation (13),(14). 

Denote £^ = U * =k T[ j) U T 2 0) for k € {*, i + 1,..., N - 1} 


2 

3 

4 

5 

6 

7 

8 

9: 

10 : 

11 

12 

13 

14 

15 

16 

17 

18 

19: 

20 : 

21 : 


if length(P) = 1 then return I|^(iv)| 

end if 

F = 0 |£( i )|.x|£(0| 

pfl’+u,£(*+!) _ r ECURSIV e CODING((D( m ), . . . , flW)) 


> The recursion 

rfi) q(^) f , 

> In the next few steps, we will describe coding co-efficients F •? ’ o , j £ {1, 2} 


if /i^ p 0 then 

r(i) d(^) 

F 2 ’ 2 Uniformly random from the field F 
end if 

0 = A = 6 


> Phase 1 


> Temporary variables (sets) used in the for loop next 


for e e 0 , , ? * do 


> Phase 2: Encoding the edges in Oj ; one edge at a time. 
0 = 0Ue,4 = 0n A^ 1 > O and A respectively represent the subsets of (/{' and A 2 \ In the 
next few steps, we will find the coding co-efficients for the edges in O, A 


rp{i+ 1) rp(i + 1) rp(l + l) 

if Grank ( H, 1 Hp , G 2 2 ) > Grank 


Hp H? 


H f H o 


rrfi) —p d( 

^2 

t 2 ^2 ^2 


r(0 


& colspan ( ) (jL colspan 


H 




r(*) 


& colspan ( IV ) C colspan 


H 


■u i 


■(0 „ 


H 


o 


G tp /-i ^4 n B 2 

2 '- I 2 '- I 2 


(0 




(i) 


then 


22 

23 

24 

25 

26 

27 

28 

29 

30 


■Ti 


v Random vector in ker(H 2 

/•(*) j+r t . i r (i) 


■0+i) 


> Alignment Step 


F 7 i Last |/{* |rows of v 

else 

j(i) r -i 

F + ’i e i +- Uniformly at random from the field F 

end if 
end for 

return F 
end procedure 


> Randomization Step 
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our 


_ r(i) r)(i) 

choice of < B 2 


maximizes 


Grank [ H 


T K 


(i+l) 


T 

FT 1 

, n 2 


6 + 1 ) 


r(0 ,(*) R (i) 

Q Q' Z 2 ,-0 2 


T-(i) r^(i) b 

with high probability. This is because choosing the entries of FL >^2 uniformly at random 


maxi¬ 


mizes, the rank of 


H 


1 1 


6 + 1 ) 


r6) M),B y 2 

G 2 g' 2 F 7 2 


6 ) 


with a probability that tends to 1 as |F| increases. 

>(*) 


After phase 1, we have found coding vectors for edges B 2 ■ 

4.3.2 Phase 2 

Let = {e^i, e, i2 , • • •, ej )7n }, where m = |0^|. We design the coding co-efficients for the m edges 

(z) 

in 0\' one-by-one, with the co-efficients designed to maximize the Grank at each step. Our choice 
of coding co-efficients is motivated by Lemma [2} In particular, the second phase is divided into q 
steps, where in the jth step, we design the coding co-efficients for edge ei t j. Each step is classified 
as a alignment step or a randomization step as follows. 

Let 0\l = 0 and = {ejp,..., e^} be the subset of the first j elements of for 1 < j < m. 

Let 45 = 4° nog- 

Case I (Alignment): The following conditions are satisfied. 


^(i + l) rT-i(*+l) rp{i + 1) 

Grank (HI 1 ,ui 1 ,G 2 


I 

colspan Ha 1 


6 ) 


colspan H 


r 7 i 


(0 


> Grank 

(£_ colspan 
C colspan 


H 

H 

H 




(0 


o (i) ' 

H, J * 


H 




6 ) 


<p6) 

H 1 ’^ 1 

1 L 


Gy 2 


(0 


77 W 

U 1 

2 


u (*) 

u i 

2 

0 M 

H m - 1 

1 '2 


A d) 

cf 2 "- 1 


(16) 

(17) 


j/6) 

G 2 2 


A 6 ) 

g 2 2j_1 


g6) 

G 2 2 


(18) 

n(»+l) 


In this case, we choose a vector v randomly in the nullspace of the column space spanned by H 2 1 

r(i) r 1 | (A | 

and choose the column vector F 1 ’t e u/ to be the last I) | rows of co 
sets are denoted by temporary variables O, A in Algorithm 

Case II (Randomization): Otherwise, that is, at least one ofT 

r(i) r \ 

is violated. In this case, we select the vector F 1 by choosing each of its entries uniformly at 
random from F. 


umn vector v. Note that the 
2 

re conditions 


If (16)-(18) are satisfied, then we refer to our coding step as the alignment step. If at least one 


Phase 2, the coding co-efficients for 


of (|T6|)-(fT8|) is violated, we refer to the coding step as a randomization step. After the q steps of 

\J lf +1) 


U Tr 


6 + 1 ) 


i<k<N 


are determined. This completes the description of the recursive coding algorithm. Before proceed¬ 
ing, we discuss some properties of our algorithm. 


Gf 2 
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Discussion 


We note that if (16) is satisfied, then, for a sufficiently large field size, we can show using Lemma 
[2] that 


Grank 
= Grank 


[/M 

Hj 1 


H 




H '> 

11 2 


(0 


H 


Ul 


(i) 


o (i) 

H? 1,2 " 1 


H, 


oW.l 

FT 1,2 

n 2 

1 

U 6) 

G 2 2 

A (i) 

G 2 2,2 

f >(®) 

gJ 2 

/■/■(*) o w 

r u i xx !.J-1 

2 n 2 


r/W 

g 2 2 

A (i) 

g 2 2 "- 


d(0 

G? 2 


+ 1 


with a probability that approaches 1 if the size of the field F grows arbitrarily. We show this below 
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where (a) follows from Statement (ra) of Lemma [lj (6) follows from (16) and (c) follows from 
Lemma [2j 

It is also instructive to note that, if p > q, 


nr(p) rji(p) r-n{p) '■r’CO rpiq) rp(q) 

Gra nk in; 1 , H 2 , G 2 2 ) > Grank(H^ ,H 2 2 , G 2 2 ( 


That is, the Grank of a stage q is no smaller than stage p which is downstream with respect to a 
stage q. 

We note that our algorithm performs better than optimal routing (multi-commodity flow) for 
certain networks, for e.g., Fig. [5} On the other hand, for certain networks our simulations have 
revealed that routing outperforms our algorithm. 


Complexity 

First consider the destination reduction algorithm. Since each iteration corresponds to one vertex, 
i.e., the common tail node of the last topologically ordered edges, the algorithm terminates in 
0(|V|) steps. 
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Now consider the recursive coding algorithm. The algorithm traverses through all the inter¬ 
mediate nodes in their topological order and performs either random coding or alignment step for 
outgoing edges. At each node v, the complexity of the coding operations is dominated by the 
complexity of the alignment step, if it is performed, which is bounded by 0(d 3 ), where d is the 
in-degree of node v. Therefore, we have the following property for the recursive coding algorithm. 

Lemma 3. For the class of directed acyclic graphs with in-degree bounded by D, the complexity of the 
recursive coding algorithm is 0(\V\D 3 ). 


5 Alternative proof of the max-flow min-cut theorem 

In this section, we give an alternate proof of the max-flow min-cut theorem. The intuition of our 
proof is provided in Section [2} Our proof is based on induction on the number of edges of the 
graph. We begin with some notation that we will use in this section. 

We denote the single unicast network graph as Q„ = (V n ,£ n ) with n edges the subscript n is 
a notation that is useful in our proof. As per the notation introduced in Section [3j we denote 
the unicast problem as £l n = ( Q n ,s,T n ), where s G V n is the source and T n c £ n is the set of 
destination edges. A linear coding solution for £l n can be described by the local coding matrix 
F n , which results in a linear network transfer M n = (I n — F n ) _1 . The source to destination linear 
transfer matrix H n can be obtained as the submatrix of M„, whose rows correspond to the source 
edges and whose columns correspond to the destination edges. Mathematically, it can be done by 
multiplying M„ with with an incident matrix A n of size S x n and then an exit matrix B n of size 
D x n, where S is out-degree of the source node and D is the cardinality of the destination edge set. 
Each row of A n is a length n unit vector indicating the corresponding source edge coming out of the 
source. Likewise, each row B„ is a length n unit vector indicating the index of the corresponding 
destination edge0 To the end, the (s — T n ) source to destination linear transfer matrix is given 
by H n = A n (I n - F n ) -1 Bj t . Our goal is to show that the rank of H n can be made equal to the 
min-cut between the source and the destination by choosing F n appropriately. We use the following 
notation for the min-cut: for any problem 0 = (Q, s, T), we denote the min-cut between the source 
node s and the destination edges T as cg(s,T). 

Without loss of generality, we assume A n • B„ = 0, that is, the source node s is not a tail 
node of any destination edges in T in the graph Q n . If there is any destination edge that is coming 
directly from the source node, then this edge can be removed; it suffices to show the max-ffow 
min-cut theorem can be proved on the remaining graph. Next we introduce a few definitions and 
lemmas which will be useful in our proof. 

5.1 Preliminary Lemmas 

Definition 2 (Atomic matrix). An atomic matrix of size n x n is an upper-triangular matrix, where all the 
off-diagonal elements are zero, except those elements in a single column. Given an upper-triangular matrix 
U, the i-th atomic matrix of U, denoted as TjW, is the atomic matrix formed by setting all the off-diagonal 
elements of U to zero, except those in column i. 

’Note the difference the incident/exit matrices and the input/output matrices A and B defined in | 2 j. The latter are not restricted 
to unit vector in rows and encompass the encoding and decoding operations at the source and receiver respectively. Here, we focus 
on the transfer matrix observed for the network. Hence, we are concerned with only incident and exit matrices. 
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A standard property in matrix algebra captures the relation between an upper-triangular matrix 
and its atomic matrices. 


Lemma 4 (Atomic decomposition). An n x n triangular matrix U with all one diagonal elements can be 
written as the product of its atomic matrices in the reverse index order, i.e. U = • U -b.-.uw. 

Since the local coding matrix F n is an n X n strict upper-triangular matrix, the quantity 
(I n — F n ) is also an upper-triangular matrix but with all diagonal elements begin equal to 1. We 
are interested in decomposing the inverse of atomic matrix (I n — F n ) ^ in order to understand the 
linear network transfer matrix. For that, we start with the following property. 

Property 1. 

((I„-F nfy 1 = (I n + F n )W . (19) 


For simplicity, we denote E n = (I„ + F n ) and define z-th atomic matrix E„ to be the z-th edge 
coding matrix. Note that the z-th column entries of E„ above the diagonal represent exactly the 
local coding vector of the z-th edge of the network. With property we note the following. 


-i 


Lemma 5 (Network transfer matrix decomposition). The network transfer matrix (I„ — F n) 
decomposed into the product of its edge matrices in the forward topological order, i.e. (I„ — F 

ni.E !: 1 


can be 
\—l 


The proof is simple and omitted. The above lemma is interesting because it decomposes the 
network transfer matrix into the local coding based into a prodoct of n matrices, where each matrix 
in the product captures the influence of the local coding vector corresponding to a single edge. The 
edge reduction lemma of |45| can be shown simply using the above decomposition. We now proceed 
to a proof of the max flow min-cut theorem. 


5.2 The proof 

Now we are ready to prove the max flow min cut theorem. It is sufficient to show the linear 
achievability of a flow that equals to the min cut. That is equivalent to the following proposition. 

Proposition 1 (Achievability of Min-Cut). For all k E ZA, given an arbitrary directed acyclic graph 
Qi, = (V, £) with k edges, and a unicast problem Q/ ; = (0/,-, s, T), there exists a k x k local coding matrix 
F such that the rank of the transfer matrix from s to T equals to the min cut between s and T, i.e. 

rank (H) = rank [a, (I fc - F/,) -1 B 7 ) = cg k (s, T) . (20) 

\l\ 

Note that this is equivalent of showing that there exists an edge coding matrices E), for the 
z-th edge, such that 

rank (H) = rank (A (l fc - F^ 1 ) B T ) = rank ^ A • f[ E® • B T ^j = c Sfc (s, T) . (21) 

We prove Proposition [I] using mathematical induction on k E Z + . When k = 1, the claim is 
trivially true. For the inductive step, we start with the following inductive assumption and claim 
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Assumption 1 (Inductive assumption). Proposition^Tjholds for all k such that 1 < k < n — 1. 

Using the above assumption, we prove Proposition [l] for the case where k = n, for an arbitrary, 
unicast problem £l n . It will be convenient to denote £ n = £,T n = T, H n = H, A n = A, B n = B. 
Let e n be an edge with the highest topological order in graph Q n . Let G n -i = (V,£ — {e n }). In 
the problem fi n , if e n is not a destination edge, i.e. e n 0 T n . then e n does not communicates 
with any destination edges, i.e. e n e, Ve 6 T, since e n has the highest topological order 
in Q n . Consequently, erasing e n do not affect the capacity and the min cut from s to T n , i.e. 
cg n (s,T n ) = cg n _ 1 (s,T n ). 

Next we focus on the case when e n E T n . Note that e n is not emitted directly from the source 
node s and is a destination edge. As a result, we can decompose the incident and exit matrices as 
follows, 


A n — [Ati_i Oj , 


B 


n 


b;_! o 

0 1 


( 22 ) 


where A n _i is the incident matrix from the source to subgraph Q n - 1 , while B*_ 1 is the exit matrix 
from Q n -1 to the first D — l destination edges. The zero column following A n _i indicates that e n is 
not emitted from the source node, whereas the unit vector in the last column of B n indicates that 
e n is a destination edge. Subsequently, consider the decomposition of the s — T n transfer matrix on 
Q n . In the following sequence of equations, we use the notation e n to denote the first n — 1 entries 
of the nth column of matrix F„; note that the last entry of the nth column is 0. We can write the 
transfer matrix as, 


H n = A n (In ~ F n) — 1 B^ = A n J] e{? • B^ 

n 

= [v 1 o]-f[v? 


2=1 
/n —1 


2=1 

'B ;_r 0 
0 1 


[An-! 0] • n Ef • E 


^ 2=1 


B*_i 0 

0 1 


-i T 


— [A n _r 0] 

— [A n _i 0] 


(In—1 — F n i) 


-1 


0 

(In—1 ~ F n i) 


0 


-1 


0 


= [A n _! (I n -1 - Fn-!)- 1 0] iJ ”- 1 ^ 


1 


B*L C, 



[B*_j 

0 

T 

n 


0 

l 


In— 1 



B*_! 01 

0 

l 


0 !. 


T 


— A n _i (l n —\ F n _i) 


-l 


B n _l e n 


(23) 


(24) 


In this process, for step (o), we apply Lemma [5] on the product of first n — 1 to obtain the network 
transfer matrix of the subgraph Q n -\, while in ( b ) we simply write out the matrix E n explicitly 
using the local coding vector e n on the edge e n and idenity matrix of (n — 1) x (n — 1). To the 
end, we decompose the source-destination transfer matrix into two parts that can be intuitively 
understood. The first part, H n _i = A n (I n -i — F n _i) B*_ x is simply the source to destination 
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transfer matrix of the subgraph Q n -\. The second part is the contribution of the last topologically 
ordered edge e n . Therefore, 

H„ = [H„_i A n _i (I n _i — F n _i) 1 e n ] . (25) 

Now consider the last edge e n . If it is not a part of any s — T n min cut on graph Q n , then 
removal of e n does not affect the min cut of the graph, i.e. cg n _ 1 (s,t) = cg n (s,t). In this case, 
leveraging the inductive assumption on the subgraph Q n -\, we claim that there exists local coding 
matrix F*_ x for the graph Q n -\ such that, 

rank (H n _i) = rank (A n _i (I n _i - F n _i) _1 B*^) = cg n _ 1 (s,t ) = cg n (s,t) . (26) 


To put it simply, when e n is not a part of any min cut set, we can ignore the this edge and achieve 
a flow equal to the min cut, using only the remaining subgraph Q n -\- To do this, we can simply 
choose the existing local coding matrix F*_j which satisfies ( |26| > and set the local coding vector at 
e n to be zero, i.e. e n = 0. Doing this eliminates the last column in (251 and guarantees that 

rank (H n ) = rank(H, t _i) = cg^^s.Tn) = cg n {s,T n ) (27) 


It remains to prove the claim in the case when e n belongs to some min-cut set for the graph 
Q n . Let v = Tail(e n ) be the tail node of theg edge e n . We examine two unicast problems on G n - i, 
fin -1 = [Q n -i,s,T n - 1 ) and D*_ 1 = (Gn-i, s, T*_ x ), where T n _i and T*_ x are given by, 

T n ~ i = T n \{e n } U In(u) , T*_ x = T n \{e n } (28) 


Note that both D n _i and D*_j are unicast problems whose underlying graph has exactly n — 1 
edges. For the source-destination min cuts, we have the following lemma. 

Lemma 6. When e n is a edge in some min cut between s and T n on G n , the min cut values on Q ri and G n -\ 
satisfy the following 


ce„_i (s, T n -!) > cg n (s, T n ) (29) 

CQ n -i (s, T*_ i) = cg n (. s , T n ) - 1 (30) 


The proof follows from elementary graph theoretic arguments and is omitted. It is straightfor¬ 
ward to see that the s — T n _i transfer matrix for is exactly given by H n _i, which is the first 
part of the matrix H n . For D n _i, the destination edge set is formed by replacing e n with its parens 
edges, i.e. incoming edges to v. Let these edge be e*,,..., , where are the edge indices, 

the exit matrix of D n _i is given by 


B 


T 

n—1 


^ 1 1 

■ 

B n-1 u O • 

• W h 


(31) 


where Uj ., 1 < j < h is a length n unit vector with 1 at position ij and zero elsewhere. Hence, the 
transfer matrix for H n _i is, 


H n _i — A n _i (I n _i — F n _i) 1 B^’_ 1 

~ [Hn—l A n _i (I n _i — F n _i) [uq ... u ?. h ]] • 
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Next, we invoke the inductive assumptions on the problems fl n _i and 12* _ x . We can conclude 
that, there exists a local coding matrix F n _i for f2 n _i and a local coding matrix F*_ x for 
such that, 

rank (H„_i) = rank (A n _i (I re _i - F n _i) _1 B„_i) = cg n _^{s,T n - 1 ) > cg n (s,T n ) (32) 

rank (H*^) = rank (a„_i (l n _i - F *.^ -1 = cg n _ 1 {s,T^_ 1 ) = cg n (s,T n ) - 1 (33) 

Finally, the following lemma helps us to find the local coding matrix for the original graph Q n 


Lemma 7. Given a local coding matrix F n _i that satisfying ( |32[ ) and a local coding matrix F*_ 1 satisfy¬ 
ing (|33[), for a sufficiently large field F, there exists p, g E F and a local coding vector e n such that 


rank 



(pFn-r + gF*^)) 




(34) 


The proof follows from the spirit of the algebraic framework of network coding introduced in 12 
and is presented in Appendix [Bj Therefore, from the two local coding matrices F n and F*_ l5 we 
can choose the local coding vectors for the last k edges randomly, and construct the local coding 
matrix F n for Q n as follows, 


F n = 


pF n _i + gF;_ x 

0 


0 


(35) 


The new local coding matrix F n will satisfy the original claim in (20 1 which provides an achievable 
flow that equals the min cut on any arbitray graph of n edges. 


Remark 4. Note that in the process of searching for the local coding matrix F n which satisfies (20 > 


and achieves the maximum flow, we essentially rely on recursive construction the local coding matrices 


through! the sequence {F„_i, F n _ 2 ,.... F i } using (35 I. These local coding matrices in turn correspond 
to maximum flow achieving coding matrices for smaller problems we constructed as subproblems, i.e. 

( l n - 2 , ■ ■ ■ - l i }■ This sequence of problems would, in fact, result in running the destination reduction 
algorithm of Sectioned n the original problem bl (with the second destination set equal to the null set). 


6 Achievability of Rate Pair (1,1) 

We prove for the special case that the rate pair (1,1) is always achievable through our algorithm 
unless there is a single edge GNS cut. We begin by stating our result formally. 

Theorem 1. Consider a two-unicast-Zproblem 11 = (Q, S, T) where, for i E {1,2}, source Si is connected 
to at least one edge in destination Tp and the GNS-cut set bound is at least 2. Then, the coding matrix F 
returned by RECURSIVECODING (REDUCTION (bl) , F) achieves the rate pair (1,1) with a probability that 
tends to 1 as the field size F increases. 

We first state the a few lemmas on the transfer matrices produced by the recursive coding 
algorithm. They are useful for the proof of Theorem [lj 

Lemma 8. Let Hi, H 2 and G 2 be matrices of dimensions P\ x Q 1 , Po x Q 1 and Pi x Q\ respecitvely. If 
Hi f 0 and G 2 f 1 0, then Grank (Hi, H 2 , G 2 ) = 1 if and only if the following holds, 

^ = rank (G 2 ) = rank ([Ho G 2 ]) = 1. 


rank (H 2 ) = rank 


Hi 

H 2 
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Lemma 9. In the recursive coding algorithm, for any i <G {0,1,2,..., IV}, the matrix 


H 


n 




T 


(i) 


does not 


contain an all zeroes column with a probability that tends to 1 as the size of field |F| increases. 

Lemma 10. Let (fR = jyfl \T^ \ Given that the recursive algorithm performs the alignment step from 

some stage k+lto stage k, then, with a probability that tends to 1 as the field size |F| increases, G ( f > f 0 
for every stage i in { 0 , 1 , 2 ..., k}. 

It is straightforward to verify Lemma [8] from the definition of Grank. The proofs of Lemmas [9] 
and 10 are provided in Appendix [E] and [D] respectively. 


Remark 5. Henceforth, all the statements of the proof hold true in a probabilistic sense. That is, the 
statements are true with a probability that tend to 1 as the field size |F| tends to infinity. To avoid laborious 
notation and wording, we omit mentioning this explicitly in our proof. 


6.1 Proof Overview 


We first provide a brief overview of the proof of Theorem [l] and summarize the basic ideas of the 
proof. Note that we only prove the necessity part of the theorem in this paper, as the sufficiency 
follows directly from the GNS outer bound 126 . In particular, it suffices to show that the recursive 
coding algorithm will generate source-to-destination transfer matrices which give an achievable rate 
region that contains the point (1,1). In the proof, we show that this is true for an arbitrary two- 
unicast-Z problem Tl = (Q, S, T ) with underlying graph Q. We shall use the achievable region given 
in Theorem 1 in 145], which is simplified to the following in the case of two-unicast-Z networks, 


'T'(O) 

R\ < rank ( H 1 1 

rp( 0 ) 

i ?2 < rank ( G 2 2 


/ rp(0) rp{G) 

Ri + R 2 < Grank H} 1 , H 2 X , G 


p(0) 




(36) 

(37) 

(38) 


In order to show that the above achievable region contains the point (1,1), we prove three claims 
on the transfer matrices, each corresponds to an upper bound in ( |36| ) to ( |38| ). Recall that the 
algorithm is divided into stages from N to 0. At stage i, the algorithm specifies coding co-efficients 
for the edges in the destination sets of fiW, specifically, for edges in U T 2 *\ U T 2 * +1 ^, 

by using the coding co-efficients in the previous stage i + 1. Also, recall that in the process of 

(z) (i) (i) 

coding for stage i, for destination T- , U- denotes the unchanged edges; I-' denotes the incoming 

(i) 

edges which are coded in stage i + 1 and are removed from the destination set; O ■ denotes the 

(i) 

outgoing edges which are to be coded from If edges and enter the destination set. Furthermore, 

(*) ■ (i) a (i) 

0 2 is the union of two disjoint set L> 2 and A 2 ; . The former set of edges are coded in phase 1 o: 
the algorithm while the latter edges are coded in phase 2 as a subset of To prove Theorem [l 
we show that 


rank 



> 1 , 


rank 



> 1 , 


( 7 ■>(*) t ">(0 

Hi 1 > H 2 



>2 , 


(39) 
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for every stage i, 0 < i < N, unless there is a single edge GNS cut in the graph. Note that the 
above statement is trivially true for the initial stage, i.e. i = N. Therefore, to prove the theorem, 
we assume that © holds true for some stage i + 1 and we show that the conditions remain true 
in stage i unless there is a single edge GNS cut set. We outline our proof steps below. 


In Claims 


and 


't(mI) 

that H^ 1 ,G 2 


n(i) 


we show that the transfer matrices H 1 1 


rp(i) 

and G 2 2 are non-zero, assuming 


2 are non-zero. We prove this claim by considering the the cases of ran- 


rp(i+l) 

domization and alignment steps separately. Since in the previous stage, the matrices H 1 1 

rp (*+ 1 ) 

and G 2 2 are non-zero by assumption, and since random linear combination of a set of 
non-zero vector does not result in a zero vector, a randomization step preserves the desired 
non-zero property of the respective matrices at stage i. In case that the stage contains an 
alignment step, we make use of the fact that conditions ©-© have to be fulfilled and the 
generated column has to satsify Lemma [2] We verify that these conditions imply that neither 

rp(i) 

G 2 2 nor H 1 1 are set to 0. 


We show in Claim[3]that in stage i, the Grank of the transfer matrices is always lower bounded 
by 2 only if there does not exist a single edge whose removal disconnects (si, Xi), (s 2 , T 2 ) and 
(s 2 ,Ti). Specifically, we assume that the claim holds for stage i + 1 and we show that it 
remains true for the next stage i. To do that, we first show that if any alignment step takes 


place between the two stages, the rank of the resulting matrix 


the rank of H^ 1 


n(i) 


H? 


(*)' 


T 

H? 


(i) 


is strictly greater than 


H 


(*) (0 


. This implies that the Grank is strictly larger than the rank of 
which is at least 1. As a consequence, we show that the Grank is at least 2. 

Subsequently, if the algorithm results in Grank less than 2, it can only carry out randomization 
steps at that stage. In this case, we use Lemmas I HI and [lO] to show that if the Grank at 
stage i reduces to 1, then (a) the edges in cannot be communicated from source S 2 ( b ) 


the sets u\'^ and B^ 1 are empty, and (c) there can be only one newly coded edge, common 


T 


>(*) 


to both destination set, i.e. 0\ 


(0 


A, 


(0 


= 1. (a), (6), and (c) imply that T 2 contains only 

one edge that S 2 communicates with, while T { ' contains this particular edge as the only edge 
in the set. Consequently, applying Property (v) in Section Ul we know that oj*' 1 is a cut set 
between source si,S 2 and T\. Furthermore, because of (a) and the fact that T 2 is a cut set 
between source si, S 2 and T 2 , we establish that = T 2 *\[/ 2 ^ is a cut set between S 2 and 
T 2 . As a result, we infer that the removal of the edge in 0\ simultaneously disconnects the 
source-destination pairs (si,Ti), ( 52 ,^ 2 ) and (s 2 ,LT). In other words, we infer that the edge 
in 0\ forms a GNS-cut set to complete the proof. 


We now formally present proofs of Claims [Tp] and [3| which combine to 


serve as a 


proof of Theorem [T} 


6.2 Proof of Theorem [T] 

T «) 

Claim 1. For all 0 < i < N, H 1 1 7 ^ 0. 
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Proof. Consider first the case of i = N. Since si communicates with t\, there exists some source edge 
(si,ni) G T \ A f . Recall that in the trivial solution for we simply set the column corresponding the 

each edge in T \ A * to a unit vector indicating the source edge it represents. Since (si, v \) G t[ N%) is a source 

rji(N) 

edge at si, it gives rise to a non-zero column in H, 1 . Thus, the claim holds for stage N. 


d+l) 


T v 

Suppose the claim of H 1 1 f 0 is true at some stage i + 1. consider the stage i. Note that only phase 

(j) (ij 

' ] f 0, since f/[C T^ \ the claim holds for H^ 1 . Otherwise, we 


r(0 


2 of the algorithm affects H, . I f H, 


rM 


must have H 1 1 


f 0. In this case, consider some step j in phase 2. 


(0 


T y 

If it is a randomization step, then H 1 1 f 0. 


If it is an alignment step, by Lemma[2j the newly generated column must satisfy, 


r(») r(i) r 1 

H 2 1 F J 1 g colspan 


rW 

Hi 1 

r( l ) 

Ho 1 


F 7 i colspan 


{jM 

H 2 1 

of 

h 2 1j - 

-1 
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of. 

H/' 2 

l yW 

K 1 

of 
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r(^) 


J K / j{i) r -i J j^l) r -i 

This is only true when H 1 1 FT d e i,jf f o. By the construction of the algorithm, Hy FT d e i,jl i s 


(0 


r(<) 


n(i) 


a column in FI^ 1 


n (0 


Therefore, H[ 1 f 0. This completes the proof. 


□ 


n(i) 


Claim 2. For all 0 < i < N, G 2 2 f 0. 


Proof. For i = N, since S 2 communicates with t- 2 , there exists a source edge ( 52 , vf) G T 2 N \ Similar to 
the previous case, in the trivial solution for f f N \ we simply set the column corresponding the each edge in 


fN) 


l 9 to a unit vector indicating the source edge it represents. Since (s 2 , vf) G T 2 i s a source edge at s 2 > 

n(N) 


it gives rise to a non-zero column in G 2 2 . Thus, the claim holds for stage N. 


r(0 


Assume the claim is true for stage i + 1 and consider stage i. If G 2 2 =f 0, then the claim holds 


n(*) 


for G 2 2 , as U 2 W C T 2 


(0 


rW 


f 0. In this case, if random linear coding is 
performed, either in phase 1 (for Iff edges) or for an edge in ff in phase 2, then the resulting column is 


Otherwise, we must have G 9 2 

,(0 


n«) 


non-zero and hence G 2 2 


/0. 
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Randomization steps do not violate Claim 3. 
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As a result, we have the following at the beginning of recursive algorithm Phase 2 between stage i + 1 
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This completes the proof. 


□ 


7 Conclusion 


The techniques of routing and random network coding have served as pillars of our encoding func¬ 
tion design in networks. These techniques are loosely analogous to wireless network achievability 
techniques of orthogonalization and random coding combined with treating interference as noise 
respectively. The paradigms of orthogonalization and random coding were challenged by interfer¬ 
ence alignment in |28j. An important milestone in the development of interference alignment for 
wireless networks was the development of numerical alignment algorithms 151 52 . In this paper, 


we have undertaken an analogous effort for alignment in wireline network coding. 

Our paper leads to several open problems. We initiate the study of two-unicast-Z networks. 
For two-unicast networks, we know that linear coding is insufficient for capacity in general, and 
that edge-cut outerbounds 26, 53 are loose. In contrast, it is an open question whether even 


scalar linear network is sufficient for two-unicast-Z networks; similarly, it is not known whether the 
GNS-cut set bound is loose for two-unicast-Z networks. Second, our approach to maximizing the 
sum-rate is rather myopic, since we greedily optimize the network coding co-efficients one edge at a 
time. In comparison, linear programming based algorithms have been formulated for routing, and 
for network coding restricted to binary field. Development of similar formulations for optimizing 
the sum-rate of the two-unicast-Z network is a promising research direction. Finally, an interesting 
question is how our approach compares to other approaches when translated to the index coding 
problem through the construction of |37 ; . 
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A Proof of Properties (ii), (iii) and (v) 

We first state and prove Lemma m which describes a useful property of the destination reduction 
algorithm. Then we prove Properties (ii) and (iii). 

Lemma 11. For any two positive integers p, q with p < q, the set of all edges in Tf 1 ' U T^ 1 ’ has a strictly 
lower topological order with respect to the edges in (rf p) U T 2 P ^ \ ^T^ p+1 ^ U T^ +1 ^ . 

Proof. For any integer m, let us denote by e ' m) , an edge of the highest topological order in set T\ '"' l U T,f m ' 1 . 

From the destination reduction algorithm, we note that for any integer m, every edge in frf rn ' > U T 2 m ^ \ (rf 171 1 l ' 1 U T'f" 1} 
has the same topological order as Furthermore, from the algorithm, we note that e- m) has a strictly 

(TTL I 1 ^ (7TI I 11 / \ 

higher topological order with respect to every edge in T r J u r 2 — ; specifically, e (m) has a higher 
topological order with respect to e^ m+1 \ This is because, in the process of obtaining of m+1 ) from (>J m ( 
we remove the edges with the highest topological order from the two destination sets T, U T 2 and re¬ 
place them with the immediate parent edges, which have a strictly lower topological order. By the transitive 
nature of partial ordering, we infer that e (j>l has a higher topological order with respect to e (q> . Therefore 
(f'F has a higher topological order with respect to every edge in T'f 1 ’ U T 2 ^. Furthermore, since every edge 
in (t^ U T 2 p ^ \ u T 2 (p+1) ) has the same topological order as e^ p \ we conclude that every edge 

in this set has a strictly higher topological order with respect to every edge in T[ q) \JT^ q) . □ 

Proof of Property (ii): For j £ {1, 2}, let Kj be the set of edges that communicate with destina¬ 
tion Tj, but do not belong to Uo<fc<iV Tj k K We show that K\ and are empty by contradiction. 

Suppose that K\ is non-empty. Let e be the highest topologically ordered edge in K\. Since there 
is a path from e to destination 1, there is at least on edge el which is a member of Out(Head(e)), 
such that there is a path from e' to destination 1. Furthermore, since e! does not belong to K \, 
there exists an integer k such that e lies in T( U T.) ' for some value of k. Let k* denote the largest 
among such integers. Therefore, we note that e' does not lie in T^ k +1 ^ U T 2 fc +1 \ By examining 
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the destination reduction algorithm, we infer that e! is among the edges of the highest topological 
order in T^ k ^ U T^ k \ In the process of obtaining n( fc ' +1 ) from all the edges in In(Tail(e')) 

are added to T^' +1 ^ or T,^' or both sets, depending on whether e! belongs to T k * , or T^ k ^ or 
both sets. In particular, edge e is added to T± k +1 ^ U T^ k +1 \ Therefore edge e does not lie in I\\ 
which contradicts our earlier assumption. Therefore the set K\ is empty. We can similarly show 
that K 2 is empty. 

Therefore, we have shown that if an edge communicates with destination j for j E {1.2}, then 
it belongs to Tj for some integer k. Let k\ be the largest integer such that e belongs to T) . 

Let A >2 be the largest integer such that e belongs to T^ 2 \ To complete the proof, we show that 
k\ = k‘ 2 - As a contradiction, suppose that k\ ^ k/ 2 . Without loss of generality, assume that k\ > • 

Then, we note that edge e belongs to T^ 2 \ but does not belong to T^ 2+l \ By examination of 
the destination reduction algoritm, we infer that this means that edge e is among the highest 
topologically ordered edges in T^ 2 \. and that edge e has a higher topological order with respect 


(fci + 1 ) . . ^(^2 + 1 ) 


to all edges in T[ } U Tr 


Since k\ > k 2 and e E U T^ 2+1 ^ \ ^T-j k ^ U 


, Lemma 

(ki) 

strictly higher topological order with respect to all the edges in T) ' U Tr 
belong to T [ k '• l contradicting our earlier assumption. Therefore k\ = k 2 - 


implies that e has a 
. Therefore e cannot 


Proof of Property (in): Let represent the set of all ec 


with respect to 


->(*) 


->(*) 


\ [T, 


(*+l) 


U Tr 




From Lemma 


11 


with a lower topological order 
we infer that 


|J T[ k) UT 2 (fe) C K®. 


i-\-l<k<N 


To complete the proof, we need to show the reverse, that is, we need to show that 


K® C J 

i+l<k<N 


T (k) u T (k ). 


Consider any edge e in K^\ Since e communicates with one of the two destinations, Property 
(ii) implies that e lies in U T,^' 1 for some integer m. Let m denote the largest integer such 
that e lies in T^ m ' 1 U T.'f 1 ' 1 . To complete the proof, it suffices to show that m > i + 1. We show this 


by contradiction. Suppose that m < i. Because e lies in ( U ) \ ( W 1 


(m+l) ^ rp{m+ 1) 


we 


§ (i) (i) 

that e has a higher topological order with respect to every edge in T) UT 2 W . 

This contradicts the assumption that e lies in K^\ Therefore, we have m > i. If m = i, then e 


\ ( t[ 1+ 1 ^ UT 2 (,+1) ) . This also contradicts the assumption that e lies in 


lies in UL 2 ( ^ x ^ 2 

Therefore we have m > i. This completes the proof. 

Proof of Property (v): We show that each set is a cut set between si, S 2 and destination 

(z) 

edge set T\. The same argument applies to the case of T) . Note that it suffices to show that 

(i) 

any path between si or s 2 and some edge in Ti passes through at least one edge in each set T, , 
i = 0,1,..., IV. Specifically, consider an arbitrary path P with length to, denoted by a sequence 
of edges, i.e. P = {ei, 62 ,..., e n }, where Head(ej) = Tail(ej+i) for 1 < 1 < to. We show that if 
Tail(ei) = si or Tail(ei) = S 2 and e n E Tf, then T(' ! nP/0 for all i = 1, 2,..., N. 
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First consider the case i = 0, since e n £ Ti and T,' 0) = Ti, clearly T^nP 7 ^ 0. To complete the 
proof, we assume that for some k, 0 < k < N, T\{k ) flP /0 holds and show that flP/ 0 . 

Let ej be some edge belonging to both P and T^\ If ej £ x[ k+1 \ then clearly, T 1 ^ fc+1 ' ) n P is 
non-empty. Suppose otherwise, i.e. ej 0 j'P+ 1 )^ then ej £ By the construction of the 

destination reduction algorithm, we know that ej is not a source edge and In(Tail(ej)) C p^ k+1 \ 
As a result, ej / ei and ej_i £ In(Tail(ej)). We have e^-i £ P and e^-i £ T^ fc+1 \ that is, we have 
shown that T-( fc+1 ) flP/ 0 . This completes the proof. 

B Proof of Lemma 0 

For convenience of notation, let cg n (s,T n ) = M. Since, 

rank(H ri _i) = rank (A n -i (I„_i - F n -i ) _1 = M - 1 (53) 

rank(H*_ 1 ) = rank (a„_i (l n -i - F *^)' 1 > M , (54) 

there exists an (M — 1) x (M — 1) submatrix Si in H„_i whose determinant f\ is not zero. Similarly 
in matrix H*_ 1 , there exists some M x M submatrix S 2 , whose determinant /2 is not zero. 

Next consider the submatrix Si in A„_i (I„_i — pF„_i — r/F*_ 1 ) 1 B*^ as well as the subma¬ 
trix S 2 in A n _i (I n _i — pF n _i - gF *^) -1 B^_ v Let their determinants be f\ (p. q) and f 2 (p,q)- 
Clearly fi{p,q) ^ 0 and f 2 {p,q ) / 0 since /i(l,0) ^ 0 and / 2 (0,1) / 0. Thus, f(p,q ) = 
fi(p, Q)f 2 {Pi q) is a non-zero polynomial. Applying Lemma 4 in [ 8 ] to f{p,q), we conclude that 
if the underlying field F is large enough, choosing p, q uniformly at random from F will yield 
fi.Pi q) 7 ^ 0 with a probability that tends to 1 as the field size increases. Equivalently, with the 
random choices of p and q, 

rank (A„_i (l n -i -pF„_i - gF*^) 1 B^_^ = M - 1 (55) 

rank (Vi (L-i - pF n -i ~ qK-i)^ > M . (56) 

Furthermore, since B *^. 1 = [ B^_ x uq | ... | u i h ], we conclude that the matrix 

A n -1 (in —1 — pFn—l ~ qF n _i) [ Uq | • • • | ] , 

which are the symbols carried by the edges eq to ei h , contains at least 1 columns that are linearly 
independent from the columns of A n _i (l n _i — pF n _i — gF*^) 1 Because e lx ,... , are 

the parent edges of e n _i_|_i,..., e n , by choosing the local coding vectors for the last edge uniformly 
at random from F, we are setting the columns of 

A n _i (in —1 pFn—i ^F ra _ 1 ) [fin— 1+1 ... e n ] 

to be a random linear combinations of the columns of 

A n —1 (in—1 pFn— 1 (/F n _i) ( Uq . . . | Uj h j . 

Therefore, 

A n —1 (in —1 pF n —1 ^Fn_l) [® n ] 
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contains exactly one linearly independent column from A n _i (l n _i — pF n ~ \ — gF*_ x ) 1 B^_ x . Hence, 
we conclude that, 


rank (A n _i (l n _i -pF n _i - gF*_i) 1 [ e n ] ) = M 


(57) 


C Proof of Lemma |2j 

Since Grank([Hi A],[H 2 B],G 2 ) > Grank (Hi, H 2 , G 2 ), it is clear that, colspan ( 


t 


colspan 


Hi 

H 2 


We first consider the case when at least one of conditions (i) and (ii) does not hold. In this 
case, we pick entries of f uniformly at random from a large enough field. 

1. If colspan (B) E colspan (H 2 ), then Bf E colspan (H 2 ), but Af 0 colspan (Hi). Consequently, 


rank 


Hi Af 
H 2 Bf 


= rank 


Hi 

H 2 


+ 1 > 


rank ([H 2 Bf G 2 ]) = rank ([H 2 G 2 ]) , 
rank([H 2 Bf]) = rank (H 2 ) . 


2. If colspan (B) 0 colspan ([H 2 G 2 ]), then Bf 0 colspan ([H 2 G 2 ]), and Af 0 colspan (Hi) 
Hence, 


rank 


Hi 

H 2 


Af' 

Bf 


= rank 


Hi 

H 2 


+ 1 ! 


rank ([H 2 Bf G 2 ]) = rank ([H 2 G 2 ]) + 1 , 
rank ([H 2 Bf]) = rank (H 2 ) + 1 . 

Therefore, in both cases, we Grank ([Hi Af], [H 2 Bf] , G 2 ) = Grank (Hi, H 2 , G 2 ) + 1. 

It remains to find the vector f that satisfies © when conditions (i) and (ii) both hold. In 
particular, it suffices to find a vector f such that 


f 0 colspan 


Hi 

H 2 

Bf E colspan (H 2 ) 


(58) 

(59) 


Suppose the columns of the matrix 


Hi 

H 2 


are indexed by the index set [Qi] = {1, 2 ,..., Q{\, while 


the columns are from 


are indexed by the index set [M] = {1,2 ,,M}. Let T be the subset 


of [Qi] such that, for the matrix H 2 , its submatrix consists columns indexed by T forms basis of 
the matrix. Denote this submatrix as Hi. Thus, rank(H 2 ) = |T|. By the basis extension theorem, 


there exists a subset of columns of 


, indexed by as C [M], such that the columns of matrix 


[H£ B Qfl ] form a basis of [H 2 B]. Since columns of Hij are linearly independent, columns of 
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H 1 


Hi 

Hi 


are also linearly independent. Hence, there exists a subset an C [Qi], such that 


an PI T = 0, such that the columns of 


Hi 

Hi 


H[ 

H 2 


H^J 


form a basis of 


Hi 

H 2 


Consider the matrix 


H A" s 




B“s 


. We show that its columns are linearly independent. Consider a length |T| 


vector c 1 , a length \an\ vector c“ H and a length \oib\ vector c Qs , such that 


Hi 

Hi 


H a H 

n 2 


A“ s 

B“s 



r„n 


c 


C aH 


C "B 


= 0 . 


( 60 ) 


We aim to show that 


r c r 1 


-OLH 


-.«B 


= 0. Since the columns of is a basis for H 2 , there exists a matrix 


D such that = Hi. Thus, 


[Hi H 


r b° b ] 


r c r 1 






= Hi(c r + Dc“ H ) + B c 


= 0 . 


( 61 ) 


But [Hi B" B ] forms a basis. As a result, c Qs = 0 and c 1 +Dc“ H = 0. Substituting this in (60), 
we get 


Hi Hr - 


'c r ' 

ttT ttH// 

r± 2 r ±2 


c aH 


= 0 . 


( 62 ) 


Since columns of 
of 


h[ Hr 

H r it«h 

2 - h -2 j 


H[ Hr A as 


Hi Hr B Qb 


also form a basis, we must have c 1 = 0 and c° H = 0. Hence, columns 
are linearly independent. 


By the basis extension theorem, there exits a set (5b C 

Hi A 


Hi Hr A“ b A^ s 

Hi Hr B as 


form a basis for the matrix 


Ho B 


M]\ub, such that the columns of 
. We next show that \(5b\ > 0. 


We know that Grank([Hi A], [H 2 B],G 2 ) > Grank (Hi, H 2 , G 2 ). Therefore, 
Grank([Hi A],[H 2 B], G 2 ) - Grank (Hi, H 2 , G 2 ) 


= rank 


Hi A 

H 2 B 


— rank 


Hi 

Ho 


— rank (H 2 ) + rank ([H 2 B]) 


(“) , 

= rank 


Hi A 

H 2 B 


— rank 


Hi 

H 2 


rank ([H 2 B G 2 ]) - rank ([H 2 G 2 ]) 


— (rank (H 2 ) — rank ([H 2 B])) 


= (|T| + \qlh\ + |as| + \(5b\ — |T| - Iasi) - (|T| - |T| + |as|) 

= \(5b\ > 0 , 

where (a) follows from condition (ii), i.e. colspan (B) C colspan ( [H 2 G 2 ]). 
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Since /3 b 7 ^ 0, note that the columns of [Hi, B" b B^ s 1 is a linearly dependent set of 
vectors. This is because, this set of vectors spans [H 2 B] , whose basis is [Hg B“ s ] . The 
nullspace of [HJ[ B“ s B^ s ] therefore contains at least one non-zero vector. Pick an arbitrary 


non-zero vector from this nullspace. Denote the vector as be 


~OtB 

r pB 


where c r is a length |T| 


subvector, c“ s is a length \olb\ subvector and c! is is a length \(3b\ subvector. We have, c^ B / 0 


and Hj’c 1 ' + B as c afl + B^ B c^ B = 0, hence, B“ s c Qs + B^ fl c^ s G colspan (Hl>) = colspan (Ho). On 


the other hand, since columns of 


Hf 

H 2 


H « H 

n 2 


is linearly independent of the columns of 

0 colspan 


A° s 

B“s 

‘Hf 

H 2 


A P* 
BP b 


is a basis, the columns of 


A aB 

B QB 


A? 13 

B Pb 


H aH 

n 2 


which is a basis for 


Hi 

H 2 


Therefore, 


A aB 

B“s 


ms 


+ 


A aB 

B“s 


Hi 

H 2 


We are now ready to specify f satisfying (58) and (59). For any set / C [M], let f 7 denote the 
entries of vector f 1 corresponding to set /. We specify the M x 1 vector f as follows: f" s = c“ s , 
f/3s _ c ps ; anc j = 0 for any set / which is disjoint with as U /3b- We note that with this choice 
of f, we have 


Bf = B" s c“ B + B fe c fe 


f = 


A aB 

B«s 


■.OLB 


+ 


A aB 

B«s 


■,3b 


It can be readily verified that (58) and (59) are satisfied. 


So far, we have showed that there is at least one vector f which satisfies (58) and 


and 


subsequently satisfies Next, we show that if conditions (i) and (ii) hold, then picking a 

random vector v from the nullspace of [H 2 B] and setting f to be the last M entries of the 
random vector v satisfies with high probability,. In particular, we show that such a vector f 


satisfies ( |58j ) and ( p9| ) with high probability, 
vi 

v 2 


Let v = 


, vi G F^ x ,V 2 G ¥ m , be a random vector from the nullspace of [H2 B] , i.e. 


H 2 V 1 + Bv 2 = 0. Clearly, f = V 2 satisfies ( p9| ). We need to show that f chosen this way satisfies 
(58) with high probability. Note that the vector V 2 fails to satisfy (|58[) if and only if there exists 


some v( G F^ 1 , such that, 


Hi A 


vi' 

H 2 B 


_ v 2_ 


= 0 


(63) 


Let 7 Z be the set of all such vectors in the nullspace of [H 2 B 

G Ker ( [H 2 B] ) : 3 vi G W Q \ s.t. 


i.e. 


n = 


Vl 

v 2 


Hi A 

H 2 B 


v 2 


= 0 


(64) 


Clearly, if 


Vl 


_ v 2 

1 


ui 

u 2 


G 1Z, then there exists v), u( G F® 1 , s.t. 


Hi 

A 

avj + bu[ 

h 2 

B 

av 2 + bu 2 


= 0 


(65) 


46 





















































for any a, b £ ¥, i.e. 


€ 1Z . Hence, 1Z is a subspace of Ker ([H 2 B]). 


av'j + bu\ 

av 2 + &U 2 

Furthermore, note that 1Z is a proper subspace. This is because we have already shown that 
there exists a vector in the null space of [H 2 B] that satisfy (58), that is, we have already shown 
the existence of a vector in the nullspace [Ho B] that does not lie in 1Z. Therefore, by picking 
v uniformly at random from Ker ([H 2 B]), the probability that both condition (581 and 
satisfied is 


are 


l-.P(veft)-l-p^ , t G Z + 


( 66 ) 


where |F| is the size of the underlying field and t is the difference between the dimension of subspace 
7 Z and the dimension of Ker ([Ho B]). Hence, as the held size increases arbitrarily, the probability 


of both (581 and (59 1 hold approaches to 1. This completes the proof. 


D Proof of Lemma |9] 


Clearly, at the initial stage, by the construction of the algorithm, every column of the matrix 
corresponds to a source edge and is assigned a unit vector. As a result, none of the columns in the 


matrix 


T 

- 1 ! 


W 


T 

- 1 ! 


(AO 


is all-zero. 


Suppose that there is no all-zero column in the matrix 


all-zero columns can be generated from random coding of columns of 


rn(i + 1)" 

rp(i + l) 

. Clearly, in 

1- 

T'Tt-hT' 

X 

1_ 

J 


L h 2 j 


no 


H 


n 


(;+i)' 


H 2 1 


1 


with a probability 


that tends to 1 as the held size increases. It remains to consider the case when alignment happens. 

die alignment step will generate a column that does not belong to the column 




(A 


Ul 


M 


But by Lemma [2j t 
span of the matrix 
This completes the proof. 

E Proof of Lemma HOl 


Hence, any column generated by alignment cannot be all zero either. 


Suppose that from stage k + 1 to stage k. the algorithm performs some alignment step. Since 
A^ c 0^\ we have Q ^ = T^\T^ = U^' 1 U . By the construction of the algorithm, 

if at some step j in phase 2, an alignment step takes place, then we have, 


colspan H, 1 


rW 


colspan H 
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(k) 


(£ colspan 
C colspan 
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(k) 


o (z) 

r>( fc ) 


jj( k ) 

G^ 2 


l(fc) 


at 
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Since 


c O 


(fc) 


A w 

G*»~‘ 


qW j 

is a submatrix of H 2 J_ 


For the conditions to hold, we must have 



UB^ fc) \!7 1 (fc) 



/o. 


Now consider = T 2 fe 1 \ By the construction of the algorithm, Q^ k ~^ does not 

communicate with T\. Hence, all the columns corresponding to edges in Q^ k ~ 1 ^ are either source 
edge columns or columns generated by random coding in phase 1 at some stage of the algorithm. 
On the other hand, since all edges in Q^ communicate with T -2 but not Tf, for each edge £ Q^ 
there exists an edge e [ k £ Q( k ~^ such that either e^ = ef ^ or is a parent edge of e[ k 
Thus, each column in G^ ( participates in random coding for at least one edge in Q^ k ~ l \ Now 

n(k) Q(fc—1) # Q(k—i) 

since G 2 7 ^ 0, we conclude that at least one column of G 2 is not all-zero and thus G 2 is 

Q(i) 

not a zero matrix. Subsequently we have, for all 0 < i < k, G 2 7 ^ 0, which completes the proof. 
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